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Abstract. A famous theorem of Kuratowski states that in a topological 
space, at most 14 distinct sets can be produced by repeatedly applying 
the operations of closure and complement to a given set. We re-examine 
this theorem in the setting of formal languages, where closure is either 
Kleene closure or positive closure. We classify languages according to the 
structure of the algebra they generate under iterations of complement 
and closure. We show that there are precisely 9 such algebras in the case 
of positive closure, and 12 in the case of Kleene closure. 



1 Introduction 

In 1922, Kuratowski proved that if S is any set in a topological space, then at 
most 14 distinct sets can be produced by repeatedly applying the operations 
of topological closure and complement to S [5,2]. Furthermore, there exist sets 
achieving this bound of 14 in many common topological spaces. 

There is a large and scattered literature on Kuratowski's theorem, most of 
which focuses on topological spaces; an admirable survey is the paper of Gardner 
and Jackson [3]. Our Theorem 1, Kuratowski's theorem in the setting of a general 
closure system, can be found in Hammer [4]. Our point of view most closely 
matches that of Peleg [6] , who briefly observed that Kleene and positive closure 
are closure operators, and hence Kuratowski's theorem holds for them. 

We reconsider this theorem in the context of formal languages, where closure 
is replaced by Kleene closure or positive closure. We describe all possible algebras 
of languages generated by a language under the operations of complement and 
closure. Our main results classify languages according to the structure of the 
algebras they generate, and give a language of each type (Theorems 5 and 7). 



2 Closure Operators and Closure Systems 

We recall the definitions and properties of closures in general. Let S be a set 
which we call the universal set. An operator □ operating on a set ICS will be 
denoted by X a . Then a mapping □ : 2 s — > 2 s is a closure operator if and only 
if it satisfies the following for all subsets X and Y of S: 



A C X a (□ is extensive); 

X CY implies X a C y D (□ is isotone); (1) 
x an = x a (□ is idempotent). 

A pair (S 1 , ) satisfying (1) is a closure system. The complement S \ X of a set 
X C S is denoted A - . The set A D is the closure of X. We say X is closed if 
X = A D . Also, AT is open if its complement is closed, and X is clopen if it is 
both open and closed. The interior of X (denoted X°) is defined to be A~ D ~. 
Note the duality between □ and o: 

x° = x- D - x a = x-°-. 

This duality also applies to (1), since we have 

X D X° (o is intensive); 

X CY implies X° C F° (o is isotone); (2) 
X°° = X° (o is idempotent). 

Moreover, it is equivalent to define (S, n ) via an interior operator satisfying (2). 
We now list some fundamental properties of closure systems; note the duality. 

Proposition 1. (a) The intersection of an arbitrary family of closed sets is 
closed, (b) The union of an arbitrary family of open sets is open. 

Proposition 2. For X G S, the following are identical: (a) X D ; (b) (~}{Y C S : 

Y D Xand Yis closed}; (c) {a S S : for all open Y C S, a G Y implies Y n X ^ 
®};(d)X-°~. 

Proposition 3. For X C S, the following are identical: (a) A ; (b) \J{Y C S : 

Y C Xand Yis open}; (c) {a E S : there exists an open Y C X, with a G Y}: 
(d) X~ a ~ . 

Proposition 4. Let X,Y G S. Then the following hold: 

(a) X a is closed. 

(b) (A U Y) D = (A D uy Q ) D . 

(c) {XC\Y) U C X a DY a . 

Proposition 5. Let X,Y G S. Then the following hold: 

(a) X° is open. 

(b) (A n Y)° = (a° nr°)°. 

(c) (AUF)° D X°UY°. 

A closure operator is topological if the union of a finite family of closed sets is 
closed (or, equivalently, if the intersection of a finite family of open sets is open). 
The closure operators which we discuss in the context of formal languages are 
not topological, and so we will not assume this property holds. 



3 Kuratowski's Complement-Closure Theorem 



Here, we state two versions of Kuratowski's theorem. The first [5] is equivalent 
to his original result generalized to an arbitrary closure system, not necessarily 
topological: 

Theorem 1. Let (S, □) be a closure system, and let ICS. Starting with X , 
apply the operations of closure and complement in any order, any number of 
times. Then at most 14 distinct sets are generated. Also, any ICS satisfies: 

x a-a-a-a =x n-a_ (3) 

A closure operator □ preserves openness if X a is open for all open sets X, or 
cquivalently, if Y° is closed for all closed sets Y. Hence if □ preserves openness, 
then X a ° and X° a are clopen for all sets X. We will see later that positive 
closure of languages preserves openness. 

Theorem 2. Let (S, □) be a closure system such that □ preserves openness, 
and let X C S. Starting with X, apply the operations of closure and complement 
in any order, any number of times. Then at most 10 distinct sets are generated. 
Also, any X C S satisfies: 

x o-a-a =x a-a-_ (4) 

Proof. Let T = {X, X D , X°, X° a , X D ° , X~, X a ~, X°~, X°°- , X D °-} be a 
family of sets, not necessarily all distinct. Clearly, T is closed under comple- 
ment. If □ preserves openness, then X° a and X a ° are both clopen. Hence X° a , 
X D ° , I° D_ , and X D °~ are all closed and thus equal to their closures. Equa- 
tion (4) then follows since X D ~ a ~ = X n ° . Moreover, X a and X°~ are also 
closed, leaving X~ and X a ~ as the only sets whose closures could possibly lie 
outside T. But X~ D = X°~ and X D ~ n = X D °- , so T must be closed under 
both complement and closure. Hence only the sets in T may be generated from 
X by repeatedly applying complement and closure, and the result follows. ■ 

In 1983, Peleg [6] defined a closure operator to be compact if it satisfies 
Equation 4. He showed that at most 10 different sets are generated if □ is 
compact, and proved that □ preserves openness if and only if it is compact. 
Thus Theorem 2 is a modified version of Peleg's result. 

4 Positive and Kleene Closures of Languages 

We deal now with closures in the setting of formal languages. Our universal set 
is S* , the set of all finite words over a finite non-empty alphabet S. We consider 
two closure operators: positive closure and Kleene closure. For LC£*,we define 
L-=S*\ L, L+ = U>! L\ and L* = LU L\ 



Proposition 6. Let L, M C E* . Then the following hold: 

(a) LCL+ and L C L* . 

(b) L C M impZies L+ C M+ and L* C M*. 

(c) L++ = L+ andL** = L* . 

Corollary 1. Positive closure and Kleene closure are both closure operators. 

We note, importantly, that the positive and Kleene closures are not topo- 
logical. As a counterexample, observe that (oa)+ U (aaa)+ C (aa U aaa)+, as a 5 
belongs to the right-hand side but not the left. Consequently, languages do not 
form a topology under positive or Kleene closure. 

A language is positive- closed if it is a closed set under positive closure. We 
define positive-open, Kleene- closed, and Kleene-open similarly. The positive in- 
terior of a language L is L® = L~ + ~; the Kleene interior is L® — L~*~ . 

Proposition 7. Let L C E* . The following are equivalent: 

(a) L is positive-closed. 

(b) LU {e} is Kleene- closed. 

(c) L = L+. 

(d) L = M+ for some MCE*. 

(e) For all u,v G L, we have uv E L. 

Proposition 8. Let L C E* . The following are equivalent: 

(a) L is positive-open. 

(b) L\ {e} is Kleene-open. 

(c) L = L®. 

(d) L = M® for some M C S* . 

(e) For all u, v E S* such that uv E L, we have u E L or v E L. 

In algebraic terms, a language L C S* is a semigroup if uv E L for all 
u,v E L. Proposition 7 therefore states that a language is positive-closed if and 
only if it is a semigroup. Additionally, L C S* is Kleene-closed if and only if it is 
a monoid — a semigroup containing e. We easily verify that if L is positive-closed, 
then so are L\{e} and LU{e}. Thus there is an obvious 2-to-l mapping between 
positive-closed and Kleene-closed languages — positive-closed languages may or 
may not contain e, and Kleene-closed languages must contain e. 

Since positive closure and Kleene closure are so similar, we shall restrict our 
attention to positive closure from this point onward. This will allow us to state 
our theorems more elegantly, as we will not need to worry about e. For the 
remainder of this article, a language is closed if it is positive-closed, open if it is 
positive-open, and clopen if it is both positive-closed and positive-open. 

Example 1. We first give an example of a clopen language. Let £ be an alphabet 
and let Si, £2 Q For w E S* , let |u>|i (respectively, |w|2) denote the number 
of distinct values of i for which w[i] E T,\ (respectively, w[i] E £2)- Suppose 
k > 0. Then {w E S* : \w\i < k\w\ 2 } is clopen. 



To prove this, we let L = {w G S* : \w\i < k\w\2}- Let u, v G L. Then 
\u\i < k\u\ 2 and \v\i < k\v\ 2 . But \uv\i = + \v\i < k\u\ 2 + k\v\ 2 = k\uv\ 2 , 
so uv G L, and thus L is closed. By a similar argument, we can prove that 
L~ = {w G S* : w|i > fc|w| 2 } is closed. Thus L is clopen. ■ 

Example 2. Next, we give several examples of open languages. A language L is 
prefix- closed if and only if for every w G L, each non-empty prefix of w is in L. 
We analogously define suffix- closed, subword- closed, and factor- closed languages. 
Here by subword, we mean an arbitrary subsequence, and by factor, we mean 
a contiguous subsequence. For any L C S* , if L is prefix-, suffix-, factor-, or 
subword-closed, then L is open. 

To verify the claim for prefix-closed languages, we show that L satisfies 
Proposition 8 (e). Let w G L and suppose w — uv. Then u G L if L is prefix- 
closed, so our characterization holds and L is open. The proof is similar if L is 
suffix-closed. Since factor- and subword-closed languages are also prefix-closed, 
the claim holds. ■ 

Example 3. Languages that are left ideals (have the form L = S*L), right 
ideals (L = LS*), two-sided ideals (L = S*LS*), or have the form L = 
Uoi— o„eL S*aiS* ■ ■ ■ E*a n S*, all satisfy L — L + , and so are positive closed. ■ 

In the 1970's, D. Forkes proved Equation 3 with the Kleene closure as □, and 
the first author then proved that Equation 4 holds when □ is positive closure. 
(They were both unaware of [5].) Peleg [6] proved this in greater generality over 
a wider class of operators. Here, we give a proof of an equivalent fact: positive 
closure preserves openness. 

Theorem 3. Let L C S* be open. Then L + is open. 

Proof. To show this, we use our characterization of open languages given in 
Proposition 8 (e). Let be such that uv G L + . Then let uv = x\x 2 ■ ■ ■ x n 

where each Xi G L. There must exist an index j, 1 < j < n, such that u = 
x\ ■ ■ -Xj-ix'j, v — x'jXj+i ■ ■ ■ x n , and x'jx" — Xj. But since x'jx" — xj G L and 
L is open, either x'j £ I or x'j G L. Thus either u = x\ ■ ■ - Xj-ix'j G L + or 
v = x'jxj + i ■ ■ ■ x n G L + . Hence L + is open by our characterization. ■ 

By arguments similar to those in the proof of Theorem 2, we may conclude: 

Corollary 2. Let L C S* . Then L +s> and L® + are clopen. Moreover, if L is 
open, then L + is clopen, and if L is closed, then L® is clopen. 

The converses of the above results are false; for example, there exist lan- 
guages such as {a, aaaa} which are not open but have clopen closures. We will 
discuss such possibilities extensively in the next section. For now, we give a 
characterization of the languages with clopen closures and clopen interiors. 

Theorem 4. Let L C S* . 

(a) L + is clopen iff there exists an open language M with L C M C L + . 



(b) L® is clopen iff there exists a closed language M with L D M D L® . 

Proof. We prove only (a); (b) can be proved using a similar argument. The 
forward direction of (a) is trivial since we can take M = L + . For the converse, 
we note that L C M implies L + C M + by isotonicity, and M C L + implies 
M + C L ++ = L + by isotonicity and idempotency. Thus M + = L + , and since 
M + is the closure of an open language, it is clopen and the result follows. ■ 

5 Kuratowski's Theorem for Languages 

For any language L, let A(L) be the family of all languages generated from L by 
complementation and positive closure. Since positive closure preserves openness, 
Theorem 2 implies that A(L) contains at most 10 languages. As we shall see, 
this upper bound is tight. Moreover, there are precisely 9 distinct finite algebras 
(A(L), + ~ ). Since the languages in A(L) must occur in complementary pairs, 
there can only exist algebras containing 2, 4, 6, 8, or 10 distinct languages. 
We will provide a list of conditions that classify languages according to the 
structure of (A(L)+ ~ ), and thus completely describe the circumstances under 
which \A(L)\ is equal to 2, 4, 6, 8, or 10. 

We will also explore Kleene closure, where there are subtle differences. Let 
D(L) be the family of all languages generated from L by complementation and 
Kleene closure. Kleene closure does not preserve openness, since Kleene-closed 
languages contain e and Kleene-open languages do not. Therefore we must fall 
back to Theorem 1, which implies that D(L) contains at most 14 languages, and 
we will show that this bound is also tight. There are precisely 12 distinct finite 
algebras (D(L)* ~ ). We shall describe these algebras by relating them to those 
in the positive case. 

In a sense, our results are the formal language analogue of topological results 
obtained by Chagrov [1] and discussed in [3] . Peleg [6] noted the tightness of the 
bounds of 10 and 14 in the positive and Kleene cases, but went no further. 

5.1 Structures of the Algebras with Positive Closure 

We may better understand the structure of A(L) by first analyzing a related 
algebra of languages. Let B(L) be the family of all languages generated from 
L by positive closure and positive interior, and let C(L) = {M : M~ G B(L)} 
be their complements. Recall that the closure of an open language is clopen 
and the interior of a closed language is clopen by Corollary 2. Since the closure 
and interior operators are idempotent on the clopen languages L + ® and 
it follows that B(L) = {L, L + , L +(S , L® , L e+ } . Of course, these five languages 
may not all be distinct; we will address this later. At the moment, we provide 
the following proposition, which demonstrates that it suffices to analyze the 
structure of B(L) to determine the structure of A(L). 

Proposition 9. Let L C S* . Then A(L) = B(L) U C(L), and the union is 
disjoint. 



Proof. Clearly A(L) D B(L) U C(L), since any language generated from L by 
closure, interior, and complement can be generated using only closure and com- 
plement by the identity L® = L~ + ~ . To prove the reverse inclusion, we let 
M G A(L). Then there is some string of symbols z G {+, — }* such that M = L z . 
We constuct a string z' G {+,—,©}* by starting with z and repeatedly re- 
placing all instances of — + by ©— and all instances of — © by H — , until no 
such replacements are possible. Since L~ + = L®~~ and L~® = L + ~, we have 
M = L z . However, in producing z', we effectively shuffle all complements to the 
right. Consequently, the operation performed by z' is a series of positive closures 
and interiors followed by an even or odd number of complements. Hence either 
M G B(L) or M G C(L), and thus A(L) = B(L) U C(L). 

We now prove that B(L) n C(L) = 0. We assume otherwise to obtain a 
contradiction; B(L) must then contain some complementary pair of languages M 
and M~. We note that L® C L® + by extensivity, L® C L C L + by intensivity 
and extensivity, and L® C L + ® by isotonicity, and hence L® C M for all 
A'/ G B{L). Thus for two languages in £?(L) to be complements, L® must be 
empty. Then L contains no strings of length f, and hence L + and L + ® do not 
cither. But then no language in B{L) contains a string of length f , and thus no 
pair of languages in B{L) are complements, and we have our contradiction. ■ 

Proposition 9 implies that = 2\B(L)\, and moreover that there is 

an exact l-to-2 correspondence between the languages in B(L) and A(L): each 
language in B(L) can be associated with itself and its complement. Hence the 
algebra (A(L), + ~ ) can be constructed by simply merging the two algebras 
(£>(L), + ,®) and (C(L), + ,® ) and adding the complement operator. Thus we 
have reduced the problem of describing all algebras (A(L), + ,~ ) to the simpler 
task of describing the algebras (B(L), + ,® ). Before we proceed, we need to ex- 
clude a possible case via the following: 

Lemma 1. Suppose L C £*. If L + and L® are both clopen, then L must be 
open or closed. 

Proof. Seeking a contradiction, we assume that both L + and L® are clopen but 
L is neither open nor closed. If L is not open, then L \ L® is non-empty. 

Let w be the shortest word in L \ L®. Consider M = L® U {w}. It must 
not be open, because if it were, we would have M C L® by Proposition 3. Then 
Proposition 8 (e) must fail to hold for some word in M. But it holds for all words 
in L® and thus must fail for w. Then there exists nonempty words x and y with 
xy = w, but x ^ M and y ^ M. Then neither x nor y is in L® . 

By our assumption that L + is open, the fact that w G L + implies that either 
x G L + or y G L + . Suppose without loss of generality that x G L + . Then x 
is the concatenation of a list of words from L; we write x = uiu 2 ■ ■ ■ u n with 
Ui G L for all I < i < n. Then \m\ < \x\ < \w\ for all i and thus G L® for all 
i by our definition of w as the shortest word in L \ L®. However, x is then the 
concatenation of a list of words from L® and is thus an element of L® + , which 
is L® since we assumed L® was closed. This is a contradiction since x ^ L®. ■ 



Finally, we characterize the 9 possible algebras (B(L), + ,® ). Table 1 classifies 
all languages according to the structures of the algebras they generate and gives 
an example of each type. Here, we briefly explain our analysis. Clearly B(L) = 
{L} if and only if L is clopen, giving case (1). If L is open but not closed, then 
B(L) = {L,L®} since L + must then be clopen. Similarly, if L is closed but 
not open, then B(L) = {L,L®}. These situations yield cases (2) and (3). We 
henceforth assume that L is neither open nor closed, and thus L, L®, and L + 
are all different. The remaining cases depend on the values of L® + and L + ® . 
Both must be clopen, so neither can equal L. Lemma 1 proves that L® and L + 
cannot both be clopen. If neither L® nor L + are clopen, then we have case (8) if 
L® + and L + ® are equal and case (9) if they are not. The remaining cases occur 
when one of L + and L® is clopen and the other is not. If L+ is clopen and L® 
is not, then we get case (4) if L® + = L + and case (6) otherwise. Analogously, 
if L® is clopen and L + is not, then we get case (5) if L + ® = L® and case (7) 
otherwise. 



Case 


Necessary and Sufficient Conditions 


\B(L)\ 


\A(L)\ 


Example 


Dual 


(1) 


L is clopen. 


1 


2 


a 


(1) 


(2) 


L is open but not closed. 


2 


4 


a 


(3) 


(3) 


L is closed but not open. 


2 


4 


aaa* 


(2) 


(4) 


L is neither open nor closed; 
L + is clopen and L® + = L + . 


3 


6 


a U aaa 


(5) 


(5) 


L is neither open nor closed; 
L® is clopen and L + ® = L®. 


3 


6 


aa 


(4) 


(6) 


L is neither open nor closed; 
L + is open but L® is not closed; 
L®+ ^L+. 


4 


8 


a U abaa 


(7) 


(7) 


L is neither open nor closed; 
L® is closed but L + is not open; 
L+®^L®. 


4 


8 


(a U 6)* \ (o U abaa) 


(6) 


(8) 


L is neither open nor closed; 

L® is not closed and L + is not open; 

L+® = L® + . 


4 


8 


aUbb 


(8) 


(9) 


L is neither open nor closed; 

L® is not closed and L + is not open; 

L+®^L® + . 


5 


10 


aUabUbb 


(9) 



Table 1. Classification of languages by the structure of (B(L), + ,® ) 



We see that if (B(L), + ® ) has algebraic structure (2), then (C(L) + ® ) 
has structure (3). Thus we shall say that case (3) is the dual of case (2). By 
examining the conditions under which each case holds, we can easily see that 
cases (4) and (5) are also duals, as are cases (6) and (7). Cases (1), (8), and (9) 
are self-dual. This notion is useful in constructing the algebra (A(L), + ,~ ); we 



connect an instance of (B(L), + ,® ) to its dual structure in the obvious way via 
the complement operator. Figure 1 gives an example of this for case (6). 




Fig. 1. Construction of A(L), Case (6): (a) (B(L), + , ffl ), Case (6); (b) (B(L) + , ffl ), 
Case (7), the dual of Case (6) obtained by interchanging + with ffi, and "open" with 
"closed"; (c) (C(L), + , ffi ), that is, (B(L),+ , ffi ), Case (7), with elements renamed as 
complements of those of Case (6); (d) A(L) constructed from B(L) and C(L). 

In summary, we have proven 

Theorem 5. Start with any language L, and apply the operators of positive 
closure and complement in any order, any number of times. Then at most 10 
distinct languages are generated, and this bound is optimal. Furthermore, Table 1 
classifies languages according to the algebra they generate and gives a language 
generating each algebra. 

In the unary case, we obtain the following: 

Theorem 6. Start with any unary language L, and apply the operators of pos- 
itive closure and complement in any order, any number of times. Then at most 
6 distinct languages are generated, and this bound is optimal. Furthermore, pre- 
cisely cases (1) through (5) in Table 1 are possible for a unary language. 

Proof. We assume that L C a* and consider the following two possibilities: 

Case (i): a G L. Then L + = aa* or L + = a*, both of which are clopen. 
Furthermore, a e L® and thus i®+ = L + . Hence one of cases (1), (2), or (4) 
must hold and so < 6. 



Case (ii): a ^ L. Then L® = or L® = {e}, both of which are clopen. 
Furthermore, a £ L + and thus L + ® = L + . Hence one of cases (1), (3), or (5) 
must hold and so | | < 6. 

Unary examples for cases (1) through (5) can be found in Table 1. ■ 

5.2 Structures of the Algebras with Kleene Closure 

As we did in the positive case, first we restrict ourselves to closure and interior. 
Let E(L) be the family of all languages generated from L by Kleene closure and 
Kleene interior, and let F(L) = {M : M~ e E(L)} be their complements. Our 
next results relate D(L) and E(L) to A(L) and B{L). Our discussion involves 
both closure operators, so we will be explicit about which closure properties we 
are invoking (although the word clopen will still mean positive-clopen). We first 
claim the following, which can be proven in the same manner as Proposition 9: 

Proposition 10. Let L C E* . Then D(L) — E(L) U F(L), and the union is 
disjoint. 

Next, we give a way of relating E(L) to B(L). We recall that L* = L + U {e} 
andi® = L®\{e}. Consequently, E(L) C \J MeB{L) {MU{e}, M\{e}}. We now 
know enough to explicitly determine D(L) in the following case: 

Proposition 11. Let L C S* be clopen. Then D(L) = {L U {e}, L \ {e}, L~ U 
{e},L~\{e}}. 

Proof. If L is clopen, then L = L + and L = L®. Thus L* = L U {e} and 
L® — L \ {e}. These values thus form E(L) and their complements form F(L). 
Our result then follows from Proposition 10. ■ 

Since the operations of positive closure and positive interior preserve the 
presence or absence of e in a language, we may also note that if e £ L, then all 
languages in B(L) contain e, and conversely if e ^ L, then no language in B(L) 
contains e. For M G E(L), we write <j){M) to denote either M U {e} or M \ {e}, 
whichever lies in B(L). We note that <j)(M), <f>(M U {e}), and (f>(M \ {e}) are 
equal. Moreover, we note that <f>(M*) = (f>(M) + and 4>(M®) = <j)(M)®\ (f> can 
therefore be thought of as a homomorphism from E(L) to B(L). Consequently, 
E(L) C {M : <j)(M) e B(L)}. We use this idea and the classifications of Table 1 
to determine all possible algebras (E(L)* ® ). As we shall see, there are precisely 
12 distinct algebras, each containing at most 14 elements. 

We have seen what happens in case (1) when L is clopen; 2 algebras are 
possible depending on whether e 6 L or not, and we refer to these as cases 
(la) and (lb) respectively. We next examine cases (2) and (3), in which L is 
not clopen but is open or closed. Suppose L is open but not clopen, and hence 
B(L) = {L,L+}. Then L* is clopen and thus E(L* = {L*,L* \ {e}}. Since 
E(L*) C E(L) we thus have {L, L*,L* \ {e}} C E(L) C {M : 4>{M) e {L,L+}}. 
Therefore, we have two cases; either one or both of L \ {e} and L U {e} may 
be in E(L), depending on whether or not L® = L. If e ^ L, then L® = L 



and thus E(L) = {L,L*,L* \ {e}}. If e e L, then L® = L \ {e} and thus 
E{L) = {L, L \ {e}, L*, L* \ {e}}. We refer to these situations as cases (2a) and 
(2b) respectively. 

Similar possibilities occurs when L is closed but not clopen. If e G L then 
E(L) = {L,L®,L® U {e}}. If e ^ L then L* = L U {e} and thus E(L) = 
{L,L U {e},L®L® U {e}}. We refer to these situations as cases (3a) and (3b) 
respectively. 

We now turn our attention to cases (4) through (9), when L is neither closed 
nor open. 

Lemma 2. Let L C U* be neither open nor closed. Then 

E(L) = {L} U {M U {e} : M G B(L) and M closed} 

U {M \{e} : M e B(L) and M open}. 

Proof. Clearly L G E(L). We claim that no other language M with (j){M) = L 
can be in E{L). If we suppose otherwise, then such an M must be generated 
by taking the Kleene closure or interior of some other language in E(L). This 
would imply that M is open or closed, which is impossible since 0(M) = L and 
L is neither open nor closed. 

For each remaining M G B(L) \ {L}, we wish to show that M U {e} G E(L) 
if and only if M is closed, and M \ {e} G E(L) if and only if M is open. Let 
M G B(L)\{L} be generated by some non-empty sequence S of positive closures 
and positive interiors. If we replace each positive closure by a Kleene closure and 
each positive interior by a Kleene interior, then we will obtain a sequence S" that 
generates some M' G E(L) with 4>{M') = M . Now M 1 contains e if and only if 
the last operation in S' was a Kleene closure. If M is closed, we may append a 
final positive closure to any such S to obtain one in which the last operation is a 
closure. Conversely, if there exists an S whose last operation is a closure, then M 
must be closed. Thus there exists an M' G E(L) containing e with 4>{M') = M 
if and only if M is closed. By a similar argument, there exists an M' G E(L) not 
containing e with 4>(M') = M if and only if M is open. The result follows. ■ 

Lemma 2 allows us to describe the structure of the algebra (E(L)* ,® ) in cases 
(4) through (9). Algebra E{L) contains M U{e} for all closed M in B(L), M\{e} 
for all open M in B(L), and both for all clopen M in B(L). 

We summarize the 12 distinct algebras in Table 2. The conditions are identical 
to those found in Table 1; the only differences lie in cases (1), (2), and (3), where 
the initial presence or absence of e can affect the structure of the algebra. 

We now summarize our results for the Kleene case: 

Theorem 7. Start with any language L, and apply the operators of Kleene clo- 
sure and complement in any order, any number of times. Then at most 14 dis- 
tinct languages are generated, and this bound is optimal. Furthermore, Table 2 
describes the 12 algebras generated by this process, classifies languages according 
to the algebra they generate, and gives a language generating each algebra. 
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Case 


Necessary and Sufficient Conditions 




\D(L)\ 


Example 


Dual 


(la) 


L is clopen; e £ L. 


2 


4 


a* 


(lb) 


(lb) 


L is clopen; e ^ L. 


2 


4 


a+ 


(la) 


(2a) 


L is open but not clopen; e £ L. 


3 


6 


oUe 


(3a) 


(2b) 


L is open but not clopen; e ^ L. 


4 


8 


a 


(3b) 


(3a) 


L is closed but not clopen; e ^ L. 


3 


6 


aaa* 


(2a) 


(3b) 


L is closed but not clopen; e G L. 


4 


8 


aaa* U 6 


(2b) 


(4) 


L is neither open nor closed; 
L + is clopen and L® + — L + . 


4 


8 


a U aaa 


(5) 


(5) 


L is neither open nor closed; 
L® is clopen and L + ® = L®. 


4 


8 


aa 


V*) 


(6) 


L is neither open nor closed; 

is open but Z/® is not closed; 
L®+#L+. 


6 


12 


a U afeaa 


(7) 


(7) 


L is neither open nor closed; 
L® is closed but L + is not open; 
L+® # L®. 


6 


12 


(aUd)*\ (aUfltofl) 


(6) 


(8) 


L is neither open nor closed; 

L® is not closed and L + is not open; 

L+® =L®+. 


5 


10 


aUbb 


(8) 


(9) 


L is neither open nor closed; 

L® is not closed and L + is not open; 

L+® ^ L®+. 


7 


14 


aUabUbb 


(9) 



Table 2. Classification of languages by the structure of (E(L)* ,® ) 
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